Celestin Apprentice 4

home *** CD-ROM | disk | FTP | other *** search

/ Celestin Apprentice 4 / Apprentice-Release4.iso / Languages / Mops 2.7 / Mops source / System source / longMath < prev next >

Wrap

Text File | 1993-06-15 | 7.0 KB | 329 lines | [TEXT/MSET]

\ Long arithmetic. mrh Aug 90. \ Jun 92 32-bit 68000 code moved to main dic. Removed / and /mod from here \ since main dic versions are now 32-bit. \ Sept 92 Revised to support the ANSI standard. \ This file implements double-length (64 bit) addition and subtraction, 32*32->64 multiplication and 64/32->32 division, and versions of */ and */MOD which have a 64-bit intermediate result. \ This behavior is required by the ANSI standard, defined in terms of cells. It may be overkill for Mops which has 32-bit cells, so we don't put it in the nucleus, but provide it here as an option. :code S>D loc move (a6),d0 bmi.s mn ; This is quicker than two EXT's clr.l -(a6) rts mn moveq #-1,d0 push.l d0 ;code :code DNEG neg 4(a6) negx (a6) ;code :code D+ movem (a6)+,d0-d2 add d1,(a6) addx d0,d2 push d2 ;code :code D- movem (a6)+,d0-d2 sub d1,(a6) subx d0,d2 push d2 ;code :code D< loc movem (a6)+,d0-d2 cmp d0,d2 blt.s setTrue bgt.s setFalse cmp (a6),d1 bhi.s setTrue setFalse clr (a6) rts setTrue moveq #-1,d0 move d0,(a6) ;code :code D> movem (a6)+,d0-d2 cmp d0,d2 bgt.s setTrue blt.s setFalse cmp (a6),d1 blo.s setTrue bra.s setFalse ;code :code D= movem (a6)+,d0-d2 cmp d0,d2 bne.s setFalse cmp (a6),d1 bne.s setFalse bra.s setTrue ;code \ The somewhat dreaded multiply routines :code LONGMULT ; Subroutine to do long unsigned multiply. Uses D0-2. loc pop.l d0 move.l (a6),d1 clr.l -(a6) move.w d1,d2 mulu d0,d2 move.l d2,4(a6) move.l d1,d2 swap d2 mulu d0,d2 add.l d2,2(a6) swap d0 move.w d1,d2 mulu d0,d2 add.l d2,2(a6) bcc.s mpy2 addq.w #1,(a6) mpy2 move.l d1,d2 swap d2 mulu d0,d2 add.l d2,(a6) ;code :code UM* ; Unsigned mixed multiply loc bra.s p68k ; NOP'd out if we're on an 020/030 or later pop.l d1 dc.w $4C16,$1400 ; mulu.l (a6),d0:d1 move.l d1,(a6) push.l d0 rts p68k tst.w (a6) ; If both high-order words are zero, bne.s dic[longMult] ; we can do a short multiply. tst.w 4(a6) bne.s dic[longMult] pop.l d0 ; Yes, we can. move.l (a6),d1 mulu d0,d1 move.l d1,(a6) clr.l -(a6) ;code :code M* ; Signed mixed multiply loc bra.s p68k ; NOP'd out if we're on an 020/030 or later pop.l d1 dc.w $4C16,$1C00 ; muls.l (a6),d0:d1 move.l d1,(a6) push.l d0 rts p68k move.l d3,-(a7) ; save D3 tst.l (a6) smi d3 bpl.s tst2nd neg.l (a6) tst2nd tst.l 4(a6) bpl.s domult not.b d3 neg.l 4(a6) domult bsr.s dic[um*] tst.b d3 beq.s done neg.l 4(a6) negx (a6) done move.l (a7)+,d3 ; restore d3 ;code \ Division. : DIV_OVERFLOW 24 ArithErr ; : ZERO_DIV 25 ArithErr ; :code UM/MOD ; Unsigned mixed division. Code lifted ; from yours truly's PDP-11 implementation, ; which I prefer to the original Neon version. loc bra.s ummod ; NOP'd out if we're on an 020/030 or later movem.l (a6)+,d0-d2 ; Divisor to D0, dividend to D1-2 dc.w $4C40,$2401 ; divu.l d0,d1:d2 push.l d1 ; Push remainder push.l d2 ; Push quotient rts ummod tst.l (a6) beq.s dic[zero_div] ; Check for zero divide tst.l 4(a6) ; Top 32 bits of dividend zero? bne.s longdiv move.l (a6)+,(a6) ; Yes - NIP them and call U/MOD (faster) jmp dic[u/mod] longdiv pop.l d2 ; D2 = divisor pop.l d0 move.l (a6),d1 ; D0/1 = dividend cmp.l d2,d0 bhs.s dic[div_overflow] move.l d3,-(a7) moveq #31,d3 loop asl.l #1,d1 roxl.l #1,d0 bcs.s dosub cmp.l d2,d0 blo.s lptest dosub sub.l d2,d0 addq #1,d1 lptest dbra d3,loop move.l (a7)+,d3 move.l d0,(a6) ; Push remainder push.l d1 ; and quotient ;code :code M/MOD ; ( d n -- rem quot ) Signed mixed division bra.s p68k ; NOP'd out if we're on an 020/030 or later movem.l (a6)+,d0-d2 ; Divisor to D0, dividend to D1-2 dc.w $4C40,$2C01 ; divs.l d0,d1:d2 push.l d1 ; Push remainder push.l d2 ; Push quotient rts p68k movem.l d3-d4,-(a7) ; Save regs tst.l (a6) ; We make everything smi d3 ; positive then call um/mod. bpl.s mm1 neg.l (a6) mm1 tst.l 4(a6) smi d4 bpl.s mm2 neg.l 8(a6) negx.l 4(a6) mm2 bsr.s ummod tst.l 4(a6) bmi dic[div_overflow] eor.b d4,d3 ; Set sign of quotient bpl.s mm3 neg.l (a6) mm3 tst.b d4 ; Set sign of remainder - same as dividend, bpl.s rtn ; which is different from original Neon. neg.l 4(a6) ; Yes, this was a bug! rtn movem.l (a7)+,d3-d4 ; Restore regs and return ;code : */MOD >r m* r> m/mod ; : */ */mod nip ; : UMD/MOD { dndL dndH dsr \ quotH -- rem quotL quotH } dndL dndH 0 dsr um/mod -> quotH ( dndL rem1 ) dsr um/mod quotH ; \ FM/MOD and SM/REM are the ANSI division words forcing floored and symmetric \ division respectively. The 680x0 signed division is symmetric, so that \ is, naturally, the Mops default (defined by M/MOD). For FM/MOD we have \ to do a little work. : SM/REM \ ( d n -- rem quot ) m/mod ; \ FM/MOD ( d n -- mod quot ) \ If the signs of the divisor and dividend are the same, the result is \ identical to SM/REM (i.e. M/MOD), as it is if the signs differ but \ the remainder from calling M/MOD is zero. If the remainder is non-zero, \ we need to adjust by subtracting 1 from the quotient, and adding the \ divisor to the remainder. This works whichever way around the signs are. :code FM/MOD loc move.l (a6),d0 move.l 4(a6),d1 eor.l d0,d1 bpl dic[m/mod] ; Signs same - call M/MOD and out. move.l (a6),-(a7) ; Signs differ. Save divisor bsr dic[m/mod] ; Call M/MOD move.l (a7)+,d0 ; Recover divisor to D0 tst.l 4(a6) ; Remainder zero? beq.s out ; Yes - we're finished. subq.l #1,(a6) ; No - do adjustment. add.l d0,4(a6) out ;code \ NumAccumulate ( ud1 digit -- ud2 ) is a vector called by >NUMBER. It \ multiplies ud1 by BASE, then adds the digit. In the nucleus we don't \ implement double-length arithmetic, so we ignore the hi cell of ud1, \ and put zero in the hi cell of ud2. Here we implement a proper \ double-length version. : (NumAcc) { udL udH dig \ prod1H -- ud2 } udL base um* -> prod1H udH base * prod1H + dig 0 d+ ; ' (numAcc) -> NumAccumulate \ # in the nucleus ignores the hi cell of the double operand. Here \ we provide a real double version. : # base umd/mod rot 9 over < IF 7 + THEN $ 30 + hold ; : #S BEGIN # 2dup or NUNTIL ; \ Call initLongMath before using any LongMath words. At present all it \ does is to test what processor we're running on, and patch the words to \ use the 020/030 long mult and div instructions if they exist. This will \ improve the performance significantly. : INITLONGMATH { \ nop -- } processor 2 <= ?EXIT \ Out if 68000/68010 $ 4E71 -> nop nop ['] um* w! nop ['] m* w! nop ['] um/mod w! nop ['] m/mod w! patches_done ; ' initLongMath add: init_actions endload \ Comment out the endload for Neon compatibility. We've used ANSI Forth \ word names, but these are redefined to their Neon equivalents below. \ I hope you don't find these various word names as confusing as I do. : S->D s>d ; : U* um* ; : U/ um/mod ; : M/ m/mod ; : M/MOD umd/mod ;